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Abstract 


The avalLablUty of big data on human activity Is currently changing the way we look at our 
surroundings. With the high penetration of mobile phones, nearlg everyone Is already carrying 
a hlgh-preclslon sensor providing an opportunity to monitor and analyze the dynamics of human 
movement on unprecedented scales. In this article, we present a technlgue and visualization tool 
which uses aggregated activity measures of mobile networks to gain Information about human activity 
shaping the structure of the cities. Based on ten months of mobile network data, activity patterns 
can be compared through time and space to unravel the "city's pulse" as seen through the specific 
signatures of different locations. Furthermore, the tool allows classifying the neighborhoods Into 
functional clusters based on the timeline of human actlvltg, providing valuable Insights on the actual 
land use patterns within the city. This way, the approach and the tool provide new ways of looking 
at the city structure from historical perspective and potentially also In real-time based on dynamic 
up-to-date records of human behavior. The online tool presents results for four global cities: New 
York, London, Hong Kong and Los Angeles. 

Increasing dynamics of urban transformation In a rapidly developing world calls for adaptive urban 
planning. One of the central guestlons for city planners Is to know the function of different neigh¬ 
borhoods, which Is crucial for understanding their needs. Traditionally, cities maintain a database of 
land-use classification, which Is based on official records. While such data is generally of very high 
resolution, its nature limits the ability to follow changes dynamically. But today's challenges come 
together with new solutions to them: the use of anonymized mobile phone data enables dynamic mea¬ 
surement of human activity and sensing the unigue signatures of each neighborhood of a city based 
on that. Exploiting these new possibilities, recent years saw increasing usage of mobile phone data 
for better understanding people's presence around the city [1, 2], human mobility [3, 4, 5], structure of 
the city [6, 7, 8] and many other applications, including regional delineation [9, 10]. 
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Figure 1: Identifying long-term trends and events in mobile network activity in the Greater London 
area. Left: city-wide aggregated data shows a steady increase in data traffic; also notice the large 
drop around Christmas. Middle: Concentrating on the Merton district, the effect of the Wimbledon 
Tennis Championship can be seen in network activity. Right: Looking at the residual data, special 
events can be identified by large peaks. The example given here is the district containing Wemb- 
leyStadium which is a major sports venue in London; the timeline is focused on the date of the 2013 
UEFA Champions League Final. 

The presented tool [11], developed in collaboration between MIT SENSEable City Laboratory and 
Ericsson, allows the comparative analysis of city structure with the use of aggregated mobile network 
activity data in four global cities: New York, London, Hong Kong and Los Angeles. Further research 
insights coming from such analysis are presented in [12]. The tool uses aggregated data collected 
between April 2013 and January 2014 in the four cities provided by mobile network operators with 
representative market shares. The data includes measurements of human phone activity aggregated 
on the level of mobile antennas within 15-minute time intervals. These include the number of calls 
placed and text messages sent, the amount of data downloaded and uploaded, and the number of data 
reguests (number of individual times data transfer was initiated) for each antenna in each 15-minute 
time window. This way, the aggregated data does not include any sensitive customer information, 
but provides enough detail about the typical usage patterns on the scale of small neighborhoods. 
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Figure 2: Comparing the activity profile of the typical week, there is a significant distinction between a 
central business area (downtown Los Angeles, the red curve) and a residential neighborhood (Florence, 
in blue). 


We proceed by further spatial aggregation so as to account for the variation of antenna-level activity 
volumes due to technical constraints of cellular network operation. For most of the analysis, we 
normalize the resulting time series of different locations; we expect that the most information can be 
gained from comparing the shape of time series, i.e. how activity is distributed over a day or week. 
Looking at the spatial distribution of activity volumes is possible in a separate usage mode in the 
tool. 

Possible usage of the data is demonstrated via three possible usage modes in the online tool [11]: 
time series, spatial clusters and density maps. The first, time series mode facilitates the exploration 
of human activity measures on the level of city districts; possible usage is illustrated in Figs. 1, 2 
and 3. In this mode, one can look at individual time series throughout the measurement period making 
possible to identify long-term trends and special events, and weekly averages enabling the comparison 
of typical activity patterns in different neighborhoods or in different cities. Notable features in long¬ 
term time series include the steady increase in data traffic, the distinct effect of holidays, and specific 
signatures of some important events like the Wimbledon tennis championship or football matches in 
London. Looking at typical week time series, patterns representative of cities' business centers and 
residential areas can be easily identified. Apart from looking at the long-term time series of activity 
and weekly averages, it is also possible to inspect the residual activity, i.e. difference of activity on a 
specific week from the average. Special events can also be noticed by peaks in these time series as 
illustrated by the example of Wembley Stadium in Fig. 1. 

The second, spatial clusters mode of the online tool allows the interactive exploration of results 
on decomposing the cities into functional clusters based on the mobile network activity time series. 
This analysis was performed on the typical week time series of network activity aggregated into a 
regular grid. Each of the functional clusters can then be represented with a cluster-wide average 
time series and a map displaying the grid pixels which belong to it. As an example, the functional 
clusters of New York City are displayed in Fig. 4. Looking at the average cluster time series, we can 
classify them as e.g. core business, where activity is especially high in normal working hours and low 
otherwise; residential, where the peak of daily activities is in the evenings; leisure and parks, where 
activity has noticeable peaks in weekends during the day. It is also possible to compare clusters from 
different cities. For example, in Fig. 4, we show comparison of clusters identified as core business 
and residential in New York and Hong Kong. It is remarkable that the activity signatures of the 
two business clusters seem much more similar than the signatures of the two residential clusters. A 
more detailed analysis of results of clustering, including in-depth discussion of the interpretation of 
the clusters, comparison of results with traditional, census-based data, and a systematic analysis of 
differences among the four cities is presented in our related scientific publication [12]. 

The third mode of the online tool presents the spatial distribution of activity volumes in the cities. 
It also allows inspecting the ratio of volumes of different activity types, giving an overview of which 
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Figure 3: Comparing city-wide aggregated activity profiles In three cities. Apart from showing a 
high degree of similarity corresponding to the natural circadian rhythm of humans, there are notable 
differences, e.g. text message activity peaks In the morning In Flong Kong, In the evening In New 
York and midday In London or the abrupt decrease of data traffic In London when contrasting the 
evening peaks In the other two cities. We speculate that the latter observatlonls the effect of cellular 
data traffic being especially expensive In London, prompting people to switch to much cheaper wlfl 
networks when at home. 
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Figure 4: Functional clusters formed on the similarity of typical activity time series uncover Important 
structural characteristics of cities. Left: Functional clusters found In New York; Middle: typical time 
series of the clusters; Right: Comparing business and residential clusters In New York with the same 
In Flong Kong. 
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Figure 5: Spatial distribution of average volume of calls placed In Flong Kong. 


type of activity Is preferred In different parts of the cities. An example Is displayed In Fig. 5, showing 
the spatial distribution of call volumes In Flong Kong. 

We believe the presented tool to be a valuable supplement for urban planning decision support 
as well as for providing Important Insights on the way people use the city for researchers, urban 
stakeholders and general public. 
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